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Introduction 

The  concept  of  grid  computing  -  back-room  computational 
resources  that  are  as  accessible  and  available  as  the  electric 
grid  -  has  gained  momentum  [1].  Numerous  Grid 
computing  projects  such  as  NetSolve  [2]  and  Legion  [3] 
have  provided  infrastructure  to  enable  the  launching  and 
monitoring  of  mostly  parameter  sweep  applications.  These 
Grid  computing  projects  demand  that  users  endure  a  steep 
learning  curve  to  program  and  use  the  system.  Also,  these 
systems  draw  a  strong  distinction  between  the  users’ 
computers  and  the  grid  computing  resources:  users’  work  is 
done  on  their  computers,  while  grid  jobs  are  executed  on 
grid  resources.  A  computational  power  grid  should  have 
characteristics  similar  to  the  electric  power  grids:  always 
available,  ubiquitous  throughout  the  organization,  and  easy 
to  use. 

The  goal  of  the  MIT  Lincoln  Laboratory  Grid  (LLgrid) 
project  is  to  develop  a  On-Demand  Grid  Computing 
capability  to  address  these  characteristics  and  use 
MATLAB®  -  the  dominant  programming  language  for 
implementing  numerical  computations,  widely  used  for 
algorithm  development,  simulation,  data  reduction,  testing, 
and  system  evaluation  -  as  its  initial  target  application  [4]. 
MIT  Lincoln  Laboratory  has  over  one  thousand  MATLAB 
users;  nearly  two  hundred  users  run  very  long  jobs  that 
could  benefit  from  parallel  processing.  The  LLgrid  project 
has  developed  three  technologies  that  allow  these  users  to 
run  parallel  MATLAB  jobs  transparently  on  the  LLgrid 
computational  resources: 

•  MatlabMPI  for  point-to-point  messaging; 

•  pMatlab  for  global  array  semantics  (similar  to  High 
Performance  Fortran);  and 

•  gridMatlab  for  integrating  user’s  computers  into  the 
LLgrid  and  automatically  allocating  grid  computing 
resources. 

These  technologies  have  combined  to  create  a  unique  on- 
demand,  interactive  Grid  Computing  experience,  whereby 
running  a  parallel  MATLAB  job  on  LLgrid  is  identical  to 
running  MATLAB  on  the  desktop.  Users  can  use  LLgrid 
from  Windows,  Linux,  Solaris,  and  Mac  OS  X  computers 
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with  their  desktop  computer  becoming  a  personal  node  in 
the  LLgrid  thereby  establishing  a  transparent  interface 
between  the  user’s  computer  and  the  grid  resources.  LLgrid 
is  enabling  faster  algorithm  development,  prototyping,  and 
validation  cycles  for  Lincoln  staff.  In  addition,  the  initial 
creation  and  setup  of  user  accounts  is  entirely  automated, 
which  minimizes  system  administration. 

The  LLgrid  System 

A  number  of  components  comprise  the  LLgrid  On-Demand 
Grid  Computing  system,  from  the  underlying  hardware  and 
network  to  MATLAB  and  the  three  Lincoln-developed 
MATLAB  toolboxes:  pMatlab,  MatlabMPI,  and 
gridMatlab.  Figure  1  shows  a  conceptual  depiction  of  the 
initial  operational  system  including  the  LLgrid  Alpha 
Cluster  of  Red  Hat  Linux  nodes,  the  network  configuration, 
the  users’  computers  on  the  central  Lincoln  local  area 
network  (LLAN),  and  the  gridsan  network  storage,  which 
delivers  a  common  file  system  to  the  LLgrid  cluster  and  all 
of  the  users. 


Figure  1:  Conceptual  diagram  of  LLgrid  System. 

The  core  of  the  users’  experience  with  the  LLgrid  system 
occurs  with  MATLAB  and  the  pMatlab  toolbox,  while  the 
MatlabMPI  toolbox  provides  the  means  for  parallel 
MATLAB  processes  to  communicate  with  each  other  and 
the  gridMatlab  toolbox  provides  the  interface  between 
MatlabMPI  and  the  LLgrid  resources  for  managing  jobs. 
MatlabMPI  [5]  consists  of  a  set  of  MATLAB  scripts  that 
implement  a  subset  of  MPI,  allowing  any  MATLAB 
program  to  be  run  on  a  parallel  computer.  The  key 
innovation  of  MatlabMPI  is  that  it  implements  the  widely 
used  MPI  “look  and  feel”  on  top  of  standard  MATLAB  file 
I/O,  resulting  in  a  “pure”  MATLAB  implementation  that  is 
exceedingly  small  (-300  lines  of  code).  Thus,  MatlabMPI 
will  run  on  any  combination  of  computers  that  MATLAB 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

01  FEB  2005 

2.  REPORT  TYPE 

N/A 

3.  DATES  COVERED 

4.  TITLE  AND  SUBTITLE 

LLgrid:  Enabling  On-Demand  Grid  Computing  With  gridMatlab  and 
pMatlab 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS (ES) 

MIT  Lincoln  Laboratory,  Lexington,  MA  02420 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS  (ES) 

10.  SPONSOR/MONITOR’S  ACRONYM(S) 

11.  SPONSOR/MONITOR’S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release,  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

See  also  ADM001742,  HPEC-7  Volume  1,  Proceedings  of  the  Eighth  Annual  High  Performance  Embedded 
Computing  (HPEC)  Workshops,  28-30  September  2004. ,  The  original  document  contains  color  images. 

14.  ABSTRACT 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 
ABSTRACT 

uu 

18.  NUMBER 
OF  PAGES 

29 

19a.  NAME  OF 
RESPONSIBLE  PERSON 

a.  REPORT 

unclassified 

b.  ABSTRACT 

unclassified 

c.  THIS  PAGE 

unclassified 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


supports.  Overlayed  on  MatlabMPI,  pMatlab  combines 
operator  overloading  with  parallel  maps  to  provide  implicit 
data  parallelism  and  task  parallelism  [6].  pMatlab  allows  a 
MATLAB  user  to  parallelize  their  program  by  changing  a 
few  lines. 

The  gridMatlab  toolbox  transparently  integrates  the 
MATLAB  on  each  user’s  desktop  with  shared  grid  clusters 
through  a  cluster  resource  manager;  when  a  MatlabMPI  or 
pMatlab  job  is  run  by  the  user  in  his  or  her  MATLAB 
session,  gridMatlab  automatically  amasses  the  requested 
LLgrid  computational  resources  from  the  shared  grid 
resources  to  process  in  parallel  with  the  user’s  MATLAB 
session.  (In  traditional  grid  computing  the  users  must 
submit  their  jobs  to  batch  job  queues  where  the  jobs 
execution  must  wait  to  be  executed.)  Using  MatlabMPI,  the 
underlying  common  file  system  (hosted  on  gridsan) 
becomes  the  communication  fabric  through  which  each  of 
the  parallel  MATLAB  processes  communicate,  including 
the  MATLAB  on  each  user’s  desk.  By  integrating  the  user’s 
MATLAB  session  into  the  set  of  grid  cluster  MATLAB 
sessions  working  on  his  or  her  code,  the  user  receives 
immediate  feedback  on  the  status  of  his  or  her  job,  thereby 
making  parallel  MATLAB  execution  virtually  identical  to 
running  MATLAB  on  a  single  computer. 

Results 

As  with  any  distributed,  parallel  computing  system,  it  is 
expected  to  deliver  high  performance;  the  LLgrid  system 
meets  those  expectations.  For  example,  for  a  hyper-spectral 
imaging  (HSI)  analysis  application  [7],  LLgrid  is  used  to 
classify  various  characteristics  in  HSI  signal  returns  using  a 
normal  compositional  model  (NCM).  The  application  is 
composed  of  three  components:  abundance  estimates  of 
class  at  each  pixel  (upabund),  NCM  class  parameter 
updates  (up  NCM),  and  NCM  log-likelihood  computations 
for  given  class  abundance  values  (up  mll).  Figure  2  shows 
the  speedup  of  these  three  application  components,  each 
realizing  near  linear  speedup. 


Figure  2:  Speedup  realized  by  Hyperspectral  Analysis 
application 

When  it  comes  to  modeling  productivity,  we  turned  to  the 
Darpa  IPTO  High  Productivity  Computing  Systems  (HPCS) 
program.  One  of  the  challenges  in  the  HPCS  mission  is 
defining,  modeling,  and  measuring  the  productivity  that  a 
computer  system  delivers  to  its  users  and  consequently  to 
the  users’  organization.  To  answer  this  challenge,  the  HPCS 


Productivity  Team  has  developed  a  high  performance 
productivity  framework  and  evaluation  model  [8].  Their 
model  implies  a  profoundly  different  way  of  viewing  HPC 
systems  by  including  the  user-associated  costs  in  the  model 
and  by  viewing  innovative  hardware  as  a  key  aspect  to 
lowering  the  very  high  cost  of  high  performance  computing 
software. 
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Figure  3:  Range  of  Return  of  Investment  for  sample  LLgrid 
applications. 

Using  the  HPCS  productivity  model,  Figure  3  plots  two 
return  on  investment  (ROI)  values  for  the  LLgrid  system 
and  nine  sample  applications  that  are  currently  being  run  on 
the  LLgrid  system,  it  shows  a  potential  upper  and  lower 
bounds  by  calculating  the  ROI  for  average  programming 
rates  of  2  and  4  SLOCs  per  hour.  The  graph  shows  that  the 
ROI  range  is  between  2.6  and  4.6  across  the  nine  sample 
applications  and  average  programming  rates,  which  means 
that  for  every  $1.00  spent  on  the  LLgrid,  between  $2.60  and 
$4.60  is  returned  in  user  time  saved. 
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Example  App:  Prototype  GMTI  &  SAR  Signal  Processing 


Streaming 
Sensor 
Data 


Research 

Sensor 


On-board 

processing 


SAR 

GMTI 


(new) 


Real-time  front-end  processing 


Non-real-time  GMTI  processing 


LLgrid-HPEC-04  -3 
AIR  29-Sep-04 


MIT  Lincoln  Laboratory 


LLGrid 


7 


A 


Goal:  To  develop  a  grid  computing  capability  that  makes  it  as  easy  to 
run  parallel  Matlab  programs  on  grid  as  it  is  to  run  Matlab  on  own 
workstation. 
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Lab  Grid  Computing  Components 

*  Enterprise  access  to  high  throughput  Grid  computing 

*  Enterprise  distributed  storage 
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Conducted  survey  of  Lab  staff 

-  Do  you  run  long  MATLAB  jobs? 

-  How  long  do  those  jobs  run  (minutes, 
hours,  or  days)? 

-  Are  these  jobs  unclassified,  classified, 
or  both? 

Survey  results: 

-  464  respondents 

-  177  answered  “Yes”  to  question  on 
whether  they  run  long  jobs 

Lincoln  MATLAB  users: 

-  Engineers  and  scientists, 
generally  not  computer  scientists 

-  Little  experience  with  batch  queues, 
clusters,  or  mainframes 

-  Solution  must  be  easy  to  use 
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•  Easy  to  set  up 

-  First  time  user  setup  should 
be  automated  and  take  less 
than  10  minutes 

•  Easy  to  use 

-  Using  LLGrid  should  be  the 
same  as  running  a  MATLAB 
job  on  user’s  computer 

•  Compatible 

-  Windows,  Linux,  Solaris,  and 
MacOS  X 

•  High  Availability 

•  High  Throughput  for  Medium 
and  Large  Jobs 
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Processors  used  by  Job 


Allowing  Lincoln  staff  to  effectively 
use  parallel  computing  daily  from 
their  desktop 

-  Interactive  parallel  computing 

-  160  CPUs,  25  Users,  11  Groups 

Extending  the  current  space  of  data 
analysis  and  simulations  that 
Lincoln  staff  can  perform 

-  Jobs  requiring  rapid  turnaround 

Appl:  Weather  Radar  Signal 
Processing  Algorithms 

-  Jobs  requiring  many  CPU  hours 

App2:  Hyperspectral  Image  Analysis 
App3:  Laser  Propagation  Simulation 


>8  CPU  hours  -  Infeasible  on  Desktop 

>8  CPUs  -  Requires  On-Demand  Parallel  Computing 


3500  jobs,  3600  CPU  Days 
December  03  -  June  04 
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Key  Innovations: 

pMatlab  -  Global  array  semantics  for  parallel  MATLAB 
gridMatlab  -  User’s  computer  is  transparently  included  into  LLGrid 

-  User  never  logs  into  LLGrid  (only  mounts  file  system)  J 
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Alpha  Grid  Hardware 

80  Nodes  +  Head  Node  - 160+2  Processors,  320  GB  RAM 


•Dual  2.8  &  3.06  GHz  Xeon  (P4) 
•400  &  533  MHz  front-side  bus 
•4  GB  RAM  memory 
•Two  36  GB  SCSI  hard  drives 
•10/100  Mgmt  Ethernet  interface 
•Two  Gig-E  Intel  interfaces 
•Running  Red  Hat  Linux 

•Commodity  Hardware 
•Commodity  OS 
•High  Availablity 
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Can  build  a  parallel  library  with  a 
few  messaging  primitives 
MatlabMPI  provides  this 
messaging  capability: 

M  PI  S  end  ( dest,  co  m  m,  tag ,  X) ; 

X  -  MPI Recv(source,com  m,tag); 
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Can  build  a  application  with  a  few 
parallel  structures  and  functions 
pMatlab  provides  Global  Array 
Semantic  via  parallel  arrays  and 
functions 

X  =  ones  (n,  mapX) ; 

Y  =  zeros  (n,  mapY) ; 

Y(:,:)  =ffit(X); 

Mil  Lincoln  Laboratory  “ 


MatlabMPI: 

Point-to-point  Communication 


*  Any  messaging  system  can  be  implemented  using  file  I/O 

*  File  I/O  provided  by  MATLAB  via  load  and  save  functions 

-  Takes  care  of  complicated  buffer  packing/unpacking  problem 

-  Allows  basic  functions  to  be  implemented  in  ~250  lines  of  MATLAB  code 
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•  Sender  saves  variable  in  Data  file,  then  creates  Lock  file 

•  Receiver  detects  Lock  file,  then  loads  Data  file 
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•  Transparent  interface  between  pMatlab  (MatlabMPI)  and  resource  mngr. 

•  User’s  computer  is  included  in  LLGrid  for  own  job  only 

•  Amasses  requested  resources  for  on-demand,  interactive  job  computation 

•  Handles  all  communication  with  the  Condor  resource  manager  (including 
submission  file,  launch  scripts,  and  job  aborts) 

•  User  never  interacts  with  queue  system  directly 
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LLGrid  Account  Creation 
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LLGrid  Account  Setup 

*  Go  to  Account  Request  web  page; 
Type  Badge  #,  Click  “Create  Account” 

*  Account  is  created  and  mounted  on 
user’s  computer 

*  Get  User  Setup  Script 

*  Run  User  Setup  Script 

*  User  runs  sample  job 


•  Account  Creation  Script 

(Run  on  LLGrid) 


•  User  Setup  Script 

(Run  on  User’s  Computer) 


Creates  account  on  gridsan 
Creates  NFS  &  SaMBa  mount  points 
Creates  cross-mount  communication  directories 
Mounts  gridsan 

Creates  SSH  keys  for  grid  resource  access 

Links  to  MatlabMPI,  pMatlab,  &  gridMatlab  source  toolboxes 

Links  to  MatlabMPI,  pMatlab,  &  gridMatlab  example  scripts 
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Account  Setup  Steps 


Typical  Supercomputing 

•  Account  application/ 
renewal  [months] 

•  Resource  discovery 

[hours] 

•  Resource  allocation 
application/renewal 

[months] 

•  Explicit  file 
upload/download 
(usually  ftp)  [minutes] 

•  Batch  queue 
configuration  [hours] 

•  Batch  queue  scripting 

[hours] 

•  Differences  between 
control  vs.  compute 
nodes  [hours] 


Site  Setup 

•  Secondary  storage 
configuration  [minutes] 

•  Secondary  storage 
Scripting  [minutes] 

•  Interactive  requesting 
mechanism  [days] 

•  Debugging  of 
example  programs 

[days] 

•  Documentation 
system  [hours] 

•  Machine  node  names 

[hours] 

•  GUI  launch 
mechanism  [minutes] 

•  Avoiding  user 
Contention  [years] 
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LLGrid  Account  Setup 

[minutes] 

•Go  to  Account  Request 
web  page;  Type  Badge  #, 
Click  “Create  Account” 

•Account  is  created  and 
mounted  on  user’s 
computer 

•Get  User  Setup  Script 
•Run  User  Setup  Script 
•User  runs  sample  job 
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MathWorks  Distributed  MATLAB  (DML) 


“Dear  MATLAB  user, 

“This  is  an  invitation  to  participate  in  an  upcoming  Beta  Test  for  the 
Distributed  MATLAB  product.  This  will  be  available  on  the 
following  platforms,  Win  2000,  Win  NT,  WIN  XP,  and  Linux. 

“The  goal  of  this  first  release  of  Distributed  MATLAB  is  to  address 
the  requirements  of  coarse-grain  applications,  in  which  the  same 
MATLAB  algorithm  is  executed  in  remote  MATLAB  sessions  on 
different  data  sets  without  communication  or  data  exchange 
between  sessions.” 

-  From  DML  beta  email 


Lincoln  has  installed  DML  and  is  testing  it  to  determine 
how  it  integrates  with  LLGrid  technologies. 
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LLGrid  On-Demand  Grid  Computing  System 

Agenda 


•  Introduction 

•  LLGrid  System 

•  Performance  Results _ 

•  LLGrid  Productivity  Analysis 

•  Summary 
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Performance:  Time  to  Parallelize 


Important  Considerations 


Description 

Serial  Code 
Time 

Time  to 
Parallelize 

Applications  that  Parallelization 
Enables 

Missile  &  Sensor  BMD  Sim.  (BMD) 

2000  hours 

8  hours 

Discrimination  simulations 

Higher  fidelity  radar  simulations 

First-principles  LADAR  Sim.  (Ladar) 

1300  hours 

1  hour 

Speckle  image  simulations 

Aimpoint  and  discrimination  studies 

Analytic  TOM  Leakage  Calc.  (Leak) 

40  hours 

0.4  hours 

More  complete  parameter  space  sim. 

Hercules  Metric  TOM  Code  (Here) 

900  hours 

0.75  hours 

Monte  carlo  simulations 

Coherent  laser  propagation  sim. 

(Laser) 

40  hours 

1  hour 

Reduce  simulation  run  time 

Polynomial  coefficient  approx.  (Coeff) 

700  hours 

8  hours 

Reduced  run-time  of  algorithm  ■ 

training 

Ground  motion  tracker  indicator 
computation  simulator  (GMTI) 

600  hours 

3  hours 

Reduce  evaluation  time  of  larger  data 
sets 

Automatic  target  recognition  (ATR) 

650  hours 

40  hours 

Ability  to  consider  more  target  i 

classes 

Ability  to  generate  more  scenarios 

Normal  Compositional  Model  for 
Hyper-spectral  Image  Analysis  (HSI) 

960  hours 

6  hours 

Larger  datasets  of  images  j 

MIT  Lincoln  Laboratory 
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pMatlab  Application  to  3D  Spatial 

Normalization 


A  Lincoln  group  is  developing 
normalization  algorithms  for  3D 
matched-field  (MFP)  beamformers 

Sponsored  by  DARPA-ATO  under 
Robust  Passive  Sonar  program 

Large  search  space  (  0(1  e7)  cells  ) 
makes  normalizer  evaluation  on 
processed  data  difficult 

pMatlab  code  enabled  rapid 
algorithm  development  and 
parameter  selection 

-  >  20x  speedup  by  exploiting 
parallelism  across  frequency  on 
nodes  of  Linux  cluster 

-  Development  time  was  ~1  day 


Simulated  data: 

Matched  field  output  at  target  depth,  bearing 


LLgrid-HPEC-04  -19 
AIR  29-Sep-04 


Range,  km 
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Normalized  Power,  dB  Normalized  Power,  dB 


LLGrid  On-Demand  Grid  Computing  System 

Agenda 


•  Introduction 

•  LLGrid  System 

•  Performance  Results 

•  LLGrid  Productivity  Analysis 

•  Summary 
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LLGrid  Productivity  Analysis  for  ROI* 


productivity  = 
(ROI) 


r 


v 


Utility 

J 


Software 

4. 

Maintenance 

+ 

System 

Cost 

V.  J 

■ 

Cost 

^  J 

■ 

Cost 

^  J 
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LLGrid  Productivity  Analysis  for  ROI* 


productivity  = 
(ROI) 


r 

time  saved  by  users  on  system 

V,  J 


time  to 

A 

time  to 

J. 

time  to 

J. 

time  to 

J. 

system 

parallelize 

T 

train 

^  J 

T 

launch 

^  J 

T 

admin. 

^  J 

T 

cost  , 

^  J 
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LLGrid  Productivity  Analysis  for  ROI 


productivity  = 
(ROI) 


r 


time  to 
train 

V  J 


r 


time  to 
launch 

V.  J 


r 


time  to 
admin. 

^  J 


r 


system 
cost 

V.  J 


Production  LLGrid  model  assumptions 


200  users  Lab-wide 
20  simultaneous  jobs 
Average  10  CPUs  per  job 


Training  time  -  4  hours  *  Lab-wide  users 


10,000  parallel  job  launches 
10  seconds  to  launch 


One  sys-admin  ~  2000  hours 

200  CPUs  @  $5k  per  node  ~  5000  hours 
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sal  aryl 

f  Total  time*] 
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I  time  to  1  I  User!  4 1  Number  of]  I  Time  to 
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LLGrid  Productivity  Analysis  for  ROI 


productivity  = 
(ROI) 
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time  to 
train 
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time  to 
launch 

V.  J 
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time  to 
admin. 
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system 
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Production  LLGrid  model  assumptions 


200  users  Lab-wide 
20  simultaneous  jobs 
Average  10  CPUs  per  job 


Training  time  -  4  hours  *  Lab-wide  users 


10,000  parallel  job  launches 
10  seconds  to  launch 


One  sys-admin  ~  2000  hours 

200  CPUs  @  $5k  per  node  ~  5000  hours 
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ROI  = 

rvv‘/l  expected 
36.000 

1000+27.8+2000+5000 

ROIexpected  ~  4.5 

Steady  state  with 
full  LLGrid 

*  Mileage  may  vary 
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LLGrid  Productivity  Analysis  for  ROI 


r/E 


productivity  = 
(ROI) 


time  to 
train 


time  to 
launch 


time  to 
admin. 


r 


system 
cost 

V.  J 


Production  LLGrid  model  assumptions 


200  users  Lab-wide 
20  simultaneous  jobs 
Average  10  CPUs 


4.5 


Training  time  -  4  hours  *  Lab-wide  users 


1.5 


10,000  parallel  job  launches 
10  seconds  to  launch 


One  sys-admin  *  2000  hours 

200  CPUs  @  $5k  per  node  *  5000  hours 


ROI*  a  2.6  -  4.6 


ATR  Laser  Coeff  Leak  HIS  GMTI  BMD  Here  Ladar 

Sample  Applications  (sorted  by  ROI) 

□  2  SLOCs/hr  B4  SLOCs/hr 


*  Varying  Mileage 

MIT  Lincoln  Laboratory  ^ 
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LLGrid  On-Demand  Grid  Computing  System 

Agenda 


•  Introduction 

•  LLGrid  System 

•  Performance  Results 

•  LLGrid  Productivity  Analysis 

•  Summary 
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Summary 
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A 


Users 


!  LLgrid  Alpha  Cluster] 

-I*  _  *“  | 


•  Easy  to  set  up 

•  Easy  to  use 

•  User’s  computer  transparently  becomes  part  of  LLGrid 

•  High  throughput  computation  system 

•  25  alpha  users,  expecting  200  users  Lab-wide 

•  Computing  jobs  they  could  not  do  before 

•  3600  CPU  days  of  computer  time  in  8  months 

•  LLGrid  Productivity  Analysis  -  ROI  =  4.5 
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